Efficient Bayesian Nonparametric Methods for Model-Free Reinforcement Learning in Centralized and Decentralized Sequential Environments
نویسنده
چکیده
Efficient Bayesian Nonparametric Methods for Model-Free Reinforcement Learning in Centralized and Decentralized Sequential Environments by Miao Liu Department of Electrical and Computer Engineering Duke University
منابع مشابه
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Developing a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots’ states and intents. While other distributed multirobot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computation...
متن کاملA Self-organizing Multi-agent System for Adaptive Continuous Unsupervised Learning in Complex Uncertain Environments
Introduction. Continuous learning and online decisionmaking in complex dynamic environments under conditions of uncertainty and limited computational recourses represent one of the most challenging problems for developing robust intelligent systems. The existing task of unsupervised clustering in statistical learning requires the maximizing (or minimizing) of a certain similarity-based objectiv...
متن کاملBayesian Reinforcement Learning with Gaussian Process Temporal Difference Methods
Reinforcement Learning is a class of problems frequently encountered by both biological and artificial agents. An important algorithmic component of many Reinforcement Learning solution methods is the estimation of state or state-action values of a fixed policy controlling a Markov decision process (MDP), a task known as policy evaluation. We present a novel Bayesian approach to policy evaluati...
متن کاملNonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions
We present a nonparametric Bayesian approach to inverse reinforcement learning (IRL) for multiple reward functions. Most previous IRL algorithms assume that the behaviour data is obtained from an agent who is optimizing a single reward function, but this assumption is hard to guarantee in practice. Our approach is based on integrating the Dirichlet process mixture model into Bayesian IRL. We pr...
متن کاملTransfer Learning for Reinforcement Learning with Dependent Dirichlet Process and Gaussian Process
The ability to transfer knowledge across tasks is important in guaranteeing the performance of lifelong learning in autonomous agents. We propose a flexible Bayesian Nonparametric (BNP) model based architecture for transferring knowledge between reinforcement learning domains. A Dependent Dirichlet Process Gaussian Process hierarchial BNP model is used to cluster different classes of source MDP...
متن کامل